CDS

Accession Number TCMCG017C29309
gbkey CDS
Protein Id OMO64669.1
Location join(45768..45834,46205..46264,46392..46463,46600..46698,47099..47219,47293..47370,47526..47559,47632..47700,47830..47904)
GeneID InterPro:IPR005574
Organism Corchorus olitorius
locus_tag COLO4_31953

Protein

Length 224aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA215141, BioSample:SAMN03160584
db_source AWUE01020885.1
Definition RNA polymerase II, Rpb4 [Corchorus olitorius]
Locus_tag COLO4_31953

EGGNOG-MAPPER Annotation

COG_category K
Description DNA-directed RNA polymerases IV and V subunit
KEGG_TC -
KEGG_Module M00180        [VIEW IN KEGG]
KEGG_Reaction R00435        [VIEW IN KEGG]
R00441        [VIEW IN KEGG]
R00442        [VIEW IN KEGG]
R00443        [VIEW IN KEGG]
KEGG_rclass RC02795        [VIEW IN KEGG]
BRITE br01611        [VIEW IN KEGG]
ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko03021        [VIEW IN KEGG]
ko03400        [VIEW IN KEGG]
KEGG_ko ko:K03012        [VIEW IN KEGG]
EC -
KEGG_Pathway ko00230        [VIEW IN KEGG]
ko00240        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko03020        [VIEW IN KEGG]
ko05016        [VIEW IN KEGG]
ko05169        [VIEW IN KEGG]
map00230        [VIEW IN KEGG]
map00240        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map03020        [VIEW IN KEGG]
map05016        [VIEW IN KEGG]
map05169        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGTCGGAGAAGGGAGGCAAAGGGTTTTCATTGCCCAAATCTTCTCTCAAATCTACCGCTAATAGTGGAAAAGATGATGATTCAGCAAAATCAAAGAGGGGAAGGAAAGTGCAGTTTGGAGCTGAAGGTTCACCAGATCTTAACTTTAGTTTCTCGTCGCGAAAATCAGATGGCAAGTTTGCTACCCCTATTGGTAAAGGTGACTGGGCTAAGGGAGGAAAAGGAGAAAAGGCAGTCAATGGTGGAAAAACTTCTGTGGCAAAAGAAGCAAAGTCATTGGAGCTCAGAGTTGAACAGGAACTTCAAGGTAATCAAGGTACTGTTAAATGCATCATGGATTGTGAGGCTGCAAATATCTTAGAAGGAATCCAGGAACAAATGGTTATGCTCTCTGAAGACTCAACTATTAAGCTCCCCGAATCATTTAATATGGGACTGCAGTATGCCAAGACCGGTAGCTATTATACTAATCCCCAGTCTGTCAGACAAGTTCTTGAGACTCTTTCAAAATATGGTGTCACTGACAGCGAGATTTGTGTGATTGGAAATGCTTGTCCTGAAACTACCGATGAAGTTTTTGCTCTTGTGCGATCCTTGGAGGCCAAGAGAAGCAGGCTCAGTGAACCACTTAAAGATGTATTGGATGAGCTAGCTAAACTGAAACAATCCAGCTGA
Protein:  
MSEKGGKGFSLPKSSLKSTANSGKDDDSAKSKRGRKVQFGAEGSPDLNFSFSSRKSDGKFATPIGKGDWAKGGKGEKAVNGGKTSVAKEAKSLELRVEQELQGNQGTVKCIMDCEAANILEGIQEQMVMLSEDSTIKLPESFNMGLQYAKTGSYYTNPQSVRQVLETLSKYGVTDSEICVIGNACPETTDEVFALVRSLEAKRSRLSEPLKDVLDELAKLKQSS